CLAIMS 

What is claimed is: 

1. A method comprising: 

extracting at least one visual feature from a document, the document 
having a plurality of pages; 

ranking the pages in a document based on the at least one visual feature; 

and 

display pages based on ranking. 

2. The method of claim 1 wherein a plurality of visual features are 
used in ranking and at least one visual feature is weighted. 

3. The method of claim 1 wherein the visual feature is one of a 
picture, a text block, a character size, a character style, and a color 

4. The method of claim 1 wherein a visual feature is weighted based 
on gradations of the visual feature. 

5. The method of claim 1 wherein a visual feature is weighted based 
on location of the visual feature on a page. 
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6. The method of claim 1 wherein the visual feature is represented in 
a vector form. 

7. The method of claim 1 wherein the visual feature is used as a 
distance measure between a first document and a second document. 

8. The method of claim 1, further comprising clustering of a plurality 
of pages within a document. 

9. The method of claim 1, further comprising using visual features to 
reveal a transition from a first page to a second page of a document. 

10. The method of claim 1, wherein ranking of the pages includes a 
correction mechaiusm. 

11. The method of claim 1, wherein a scheme showing one of a 
plurality of pages in a docviment and a plurality of documents is by one of a 
linear display, a line of icons, and as a stack. 

12. A computer system comprising: 
a display; 

a processor coupled to the display; and 
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a memory coupled to the processor and having stored therein a routine, 
which when executed by the processor, causes the processor to generate display 
data through: 

extracting at least one visual feature from a document, the 
document having a plurality of pages, 

ranking the pages in the docvmient according to the at least one 
visual feature, 

selecting a page for representing a document according to a rank, 

and 

displaying the selected page as the display data. 

13. The computer system of claim 12 wherein a plurality of visual 
features are used in ranking and at least one visual feature is weighted. 

14. The computer system of claim 13 wherein the visual feature is one 
of a picture, a text block, a character size, a character style, and a color. 

15. The computer system of claim 13 wherein a visual feature is 
weighted based upon gradations of the visual feature. 

16. The computer system of claim 13 wherein the visual feature is 
represented in a vector form. 
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17. The computer system of claim 13 wherein the visual feature is used 
as a distance measure between a first document and a second document. 



18. The computer system of claim 12, wherein generating display data 
further comprises clustering of a plurality of pages within a document. 

19. The computer system of claim 12, wherein a plurality of pages are 
selected and generating display data further comprises using visual features to 
reveal a transition from a first page to a second page of a document. 



The computer system of claim 12, wherein ranking the pages 
includes a correction mechanism. 



^,^22: The computer system of claim 12, wherein a scheme showing one 
of a plurality of pages in a document and a plurality of documents is by one of a 
linear display, a line of icons, and as a stack. 



storage media containing executable program instructions which when executed 
by a digital processing system cause the digital processing system to: 

extract at least one visual feature from a document, the document having 
a plurality of pages. 



^0- 






An article of manufacture having at least one machine readable 
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6 rank pages in the document based on said at least one visual feature, 

7 select the pages for representing a document based on ranking, and 

8 display selected pages. 

1 v^fT The machine readable storage media of claim 23, wherein a 

2 plurality of visual features are used in ranking and at least one visual feature is 

3 weighted. 

1 J)^. The machine readable storage media of claim 23, wherein the visual 

2 feature is one of a picture, a text block, a character size, a character style, and a 

3 color. 

1 yZo. The machine readable storage media of claim 25, wherein a visual 

2 feature is weighted based upon gradations of the visual feature. 

1 27. The machine readable storage media of claim 23, wherein the visual 

2 feature is represented in a vector form. 

1 The machine readable storage media of claim 23, wherein the visual 

2 feature is used as a distance measure between a first document and a second 

3 document. 
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1 .^-^^ machine readable storage media of claim 23, further 

2 comprising clustering of a plurality of pages within a document. 

1 The machine readable storage media of claim 23, further 

2 comprising using visual features to reveal a transition from a first page to a 

3 second page of a document. 

1 The machine readable storage media of claim 23, wherein ranking 

2 the pages includes a correction mechanism. 



m 1 



The machine readable storage media of claim 23, wherein a scheme 

2 showing one of a plurality of pages in a document and a plurality of documents 

3 is by one of a linear display, a line of icons, and as a stack. 



1 ^34. A method comprising: 



2 extracting at least one visual feature from a document, the document 

3 having a plurality of pages; 

4 grouping pages in a docimient using a plurality of visual features; 

5 selecting representative pages from groups; and 

6 representing the pages. 
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